Search CORE

60 research outputs found

A Case-Based Approach to Cross Domain Sentiment Classification

Author: A. Aamodt
A. Abbasi
A. Kennedy
G. Salton
G.A. Miller
J. Demšar
P.J. Stone
S. Baccianella
W. Chapman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

This paper considers the task of sentiment classification of subjective text across many domains, in particular on scenarios where no in-domain data is available. Motivated by the more general applicability of such methods, we propose an extensible approach to sentiment classification that leverages sentiment lexicons and out-of-domain data to build a case-based system where solutions to past cases are reused to predict the sentiment of new documents from an unknown domain. In our approach the case representation uses a set of features based on document statistics, while the case solution stores sentiment lexicons employed on past predictions allowing for later retrieval and reuse on similar documents. The case-based nature of our approach also allows for future improvements since new lexicons and classification methods can be added to the case base as they become available. On a cross domain experiment our method has shown robust results when compared to a baseline single-lexicon classifier where the lexicon has to be pre-selected for the domain in question

Crossref

Arrow@TUDublin

SentiBench - a benchmark comparison of state-of-the-practice sentiment analysis methods

In the last few years thousands of scientific papers have investigated sentiment analysis, several startups that measure opinions on real data have emerged and a number of innovative products related to this theme have been developed. There are multiple methods for measuring sentiments, including lexical-based and supervised machine learning methods. Despite the vast interest on the theme and wide popularity of some methods, it is unclear which one is better for identifying the polarity (i.e., positive or negative) of a message. Accordingly, there is a strong need to conduct a thorough apple-to-apple comparison of sentiment analysis methods, \textit{as they are used in practice}, across multiple datasets originated from different data sources. Such a comparison is key for understanding the potential limitations, advantages, and disadvantages of popular methods. This article aims at filling this gap by presenting a benchmark comparison of twenty-four popular sentiment analysis methods (which we call the state-of-the-practice methods). Our evaluation is based on a benchmark of eighteen labeled datasets, covering messages posted on social networks, movie and product reviews, as well as opinions and comments in news articles. Our results highlight the extent to which the prediction performance of these methods varies considerably across datasets. Aiming at boosting the development of this research area, we open the methods' codes and datasets used in this article, deploying them in a benchmark system, which provides an open API for accessing and comparing sentence-level sentiment analysis methods

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

REPOSITORIO INSTITUCIONAL DA UFOP

Sentiment analysis methods for understanding large-scale texts: a case for using continuum-scored words and word shift graphs

Author: AB Warriner
B Liu
B Liu
B Pang
C Levallois
C Whissell
CJ Hutto
DJ Hand
E Cambria
EJ Ruiz
F Nielsen
FN Ribeiro
J Bollen
J Si
J-B Michel
JMV Rayner
JW Pennebaker
L Mitchell
M Taboada
M Thelwall
N Pappas
P Gonçalves
PJ Stone
PS Dodds
PS Dodds
PS Dodds
PS Dodds
R Socher
S Baccianella
S Bird
S Kiritchenko
S Poria
SE Alajajian
SM Mohammad
SM Mohammad
T Smedt De
T Wilson
X Zhu
Y Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

SentiHealth: creating health-related sentiment lexicon using hybrid approach

Author: DI Moldovan
E Randeree
FM Kundi
FM Kundi
FM Kundi
FM Kundi
G Fabbrizio
GA Miller
JW Pennebaker
MZ Asghar
MZ Asghar
MZ Asghar
MZ Asghar
N Bayes
O Bodenreider
S Ahmad
S Baccianella
S Bird
TH Belt
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

dispel4py: A Python framework for data-intensive scientific computing

Author: Alexander Moreno
Amrey Krause
Baccianella S
Blankenberg D
Buil-Aranda C
Filgueira R
Filgueira R
Hey AJG
Iraklis Klampanos
Malcolm Atkinson
MPI Forum
Nielsen FA
Pak A
Rosa Filguiera
Rynge M
Segaran T
Shoshani A
Vahi K
Publication venue: 'SAGE Publications'
Publication date: 01/07/2017
Field of study

This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows for distributed data-intensive applications. These combine the familiarity of Python programming with the scalability of workflows. Data streaming is used to gain performance, rapid prototyping and applicability to live observations. dispel4py enables scientists to focus on their scientific goals, avoiding distracting details and retaining flexibility over the computing infrastructure they use. The implementation, therefore, has to map dispel4py abstract workflows optimally onto target platforms chosen dynamically. We present four dispel4py mappings: Apache Storm, message-passing interface (MPI), multi-threading and sequential, showing two major benefits: a) smooth transitions from local development on a laptop to scalable execution for production work, and b) scalable enactment on significantly different distributed computing infrastructures. Three application domains are reported and measurements on multiple infrastructures show the optimisations achieved; they have provided demanding real applications and helped us develop effective training. The dispel4py.org is an open-source project to which we invite participation. The effective mapping of dispel4py onto multiple target infrastructures demonstrates exploitation of data-intensive and high-performance computing (HPC) architectures and consistent scalability.</p

Crossref

Heriot Watt Pure

Edinburgh Research Explorer

University of St. Andrews - Pure

A classiﬁcation-based review recommender

Author: H. Tang
S. Baccianella
Y. Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2009
Field of study

Paper presented at Twenty-ninth SGAI International Conference (AI-2009), Cambridge, UK, 15th-17th December 2009Many online stores encourage their users to submit product/service reviews in order to guide future purchasing decisions. These reviews are often listed alongside product recommendations but, to date, limited attention has been paid as to how best to present these reviews to the end-user. In this paper, we describe a supervised classification approach that is designed to identify and recommend the most helpful product reviews. Using the TripAdvisor service as a case study, we compare the performance of several classification techniques using a range of features derived from hotel reviews. We then describe how these classifiers can be used as the basis for a practical recommender that automatically suggests the most helpful contrasting reviews to end-users. We present an empirical evaluation which shows that our approach achieves a statistically significant improvement over alternative review ranking schemes.Science Foundation IrelandConference detailshttp://www.bcs-sgai.org/ai2009/?section=hom

Crossref

Research Repository UCD

Irish Universities